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Abstract 

^ In this paper we study the classic problem of computing a maximum cardinality matching 

*vj ' in general graphs G = {V,E). This problem has been studied extensively more than four 

J- p.. decades. The best known algorithm for this problem till date runs in 0{my/n) time due to 

^ \ Micali and Vazirani [2lj. Even for general bipartite graphs this is the best known running 

•^ ' time (the algorithm of Karp and Hopcroft [16] also achieves this bound). For regular bipartite 

graphs one can achieve an 0{m) time algorithm which, following a series of papers, has been 

recently improved to 0(n log n) by Goel, Kapralov and Khanna (STOC 2010) [15]. In this 

paper we present a randomized algorithm based on the Markov Chain Monte Carlo paradigm 

C/^ ■ which runs in 0(?7ilog n) time, thereby obtaining a significant improvement over [24|. 

q 

lyj I We use a Markov chain similar to the hard-core model for Glauber Dynamics with fugacity 

, ^ , • parameter A, which is used to sample independent sets in a graph from the Gibbs Distribution 

|31j . to design a faster algorithm for finding maximum matchings in general graphs. Motivated 
by results which show that in the hard-core model one can prove fast mixing times (for e.g. 
(^ ' it is known that for A less than a critical threshold the mixing time of the hard-core model 

OO . is 0(n log n) [27], we define an analogous Markov chain (depending upon a parameter A) on 

the space of all possible partial matchings of a given graph G, for which the probability of a 
particular matching M in the stationary follows the Gibbs distribution which is: 
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i^ ■ where is the set of all possible matchings in G. 

H 

Cu I We prove upper and lower bounds on the mixing time of this Markov chain. Although our 

Markov chain is essentially a simple modification of the one used for sampling independent sets 

from the Gibbs distribution, their properties are quite different. Our result crucially relies on 

the fact that the mixing time of our Markov Chain is independent of A, a significant deviation 

from the recent series of works |1H [26l [28l [29l [30] which achieve computational transition (for 

estimating the partition function) on a threshold value of A. As a result we are able to design a 

randomized algorithm which runs in 0(m log n) time that provides a major improvement over 

the running time of the algorithm due to Micali and Vazirani. Using the conductance bound, 

we also prove that mixing takes ^(^) time where k is the size of the maximum matching. 
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1 Introduction 

Given an unweighted undirected graph G = {V, E) with \E\ = m and |y| = n, a matching M 
is a set of edges belonging to E such that no two edges in M are incident on a vertex. If there 
is a matching of size n/2 (for n even), then it is cahed a perfect matching. The Maximum 
Matching problem is to find the maximum sized matching in a given graph. The computational 
complexity of this problem has been studied extensively for more than four decades starting 
with an algorithm of Edmonds. 

1.1 General Graphs 

Edmonds's celebrated paper 'Paths, Trees and Flowers' [9] was the first to give an efficient 
algorithm (also called the blossom shrinking algorithm) for finding maximum matching in 
general graphs. This algorithm can be implemented in 0{n^) time. The running time was 
subsequently improved in a number of papers [TUl [2U1 [22]. All these papers were variants of 
Edmonds algorithm. Even and Kariv [12] obtained an improvement to 0{v?'^) which was 
improved by Micali and Vazirani |24] who gave an 0{my/n) time algorithm for the problem 
by a careful handling of blossoms. This is the best known algorithm for finding maximum 
matchings in general bipartite graphs. 

1.2 Bipartite Graphs 

For bipartite graphs, the problem can easily be solved using the max-fiow algorithm by Ford 
and Fulkerson, an algorithm usually taught in an undergraduate algorithms course [21], which 
has a running time of 0{mn). The first algorithm for this problem was given by Konig |19j . 
Hopcroft and Karp [16] gave an algorithm that runs in 0{m^/n) time. This algorithm is an 
exact and deterministic algorithm. The problem becomes significantly simpler for regular bi- 
partite graphs. In a d-regular bipartite graph every vertex has degree d. When d is a power 
of 2, Gabor and Kariv were able to achieve an 0{m) algorithm. After significant efforts, the 
ideas used there were used by Cole, Ost and Schirra [4J to obtain a get an 0{m) algorithm for 
general d. 

In a recent line of attack by Goel, Kapralov and Khanna [13[ 114] . the authors were able 
to use sampling based methods to get improved running time. In the most recent paper they 
were able to achieve a running time of 0(n log n) for d-regular graphs |15j . Their algorithm 
performs an appropriately truncated random-walk on a modified graph to successively find 
augmenting path. 

2 Our Results 

In this paper we give a Markov Chain Monte Carlo algorithm for finding a maximum matching 
in general bipartite graphs. Our algorithm is in the spirit similar to [IH] which also is a 
'truncated random walk' based algorithm, however the stationary distribution of the underlying 
Markov Chains in their case is different from ours. Inspired from the hard-core model with 
fugacity parameter A of sampling independent sets from graphs we define a similar Markov 
Chain over the space of all possible partial matching such that its stationary distribution 7r(-) 



is the Gibbs distribution, ie. given a matching M its probabihty vr is 
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where ri is the set of all possible matchings in G. Notice that 7r(M) is maximum for maximum 
matchings and if A is a significantly large number, tt{M) tends to 1 for maximum matchings. 
Our algorithm is extremely simple (being a standard in the MCMC paradigm). Starting from 
a fixed matching we start a random walk in 0, according to the underlying graph G of the 
Markov chain. After Tmix (mixing time) steps the distribution reached by the algorithm is 
roughly the same as the Gibbs distribution. More formally, the variation distance of P* (the 
distribution after t steps) from the Gibbs distribution is less than ^. This just leaves the task 
of proving an upper-bound on the mixing time of the Markov Chain, for which we resort to 
the Coupling Method introduced by Bubley and Dyer [2]. We define a metric <!>(•, •) and apply 
the bound from [2j, to show that the Markov Chain mixes in 0(m log n) time. We also use the 
conductance method to show that the mixing time will be at least ^(^) where k is the size of 
the maximum matching. Thus upto logarithmic factors the bounds are same when k is small 
(at most polylogarithmic). The main result of our paper can be concisely written as follows: 

Theorem 1 (Main). There exists a randomized algorithm which given a graph G = {V,E) 
with \V\ = n and \E\ = m finds a maximum matching in 0{m\og n) time with high probability. 

Important remark: It has been pointed to us independently by Yuval Peres, 
Jonah Sherman, Piyush Srivastava and other anonymous reviewers that the cou- 
pling used in this paper doesn't have the right marginals because of w^hich the 
mixing time bound doesn't hold, and also the main result presented in the paper. 
We thank them for reading the paper with interest and promptly pointing out 
this mistake. 

2.1 Organization 

The paper is organized as follows: in Section [3] we give a brief description of the basic idea and 
technique that are underlying our algorithm. Subsequently in Section SI we give an overview 
of the MCMC paradigm which includes basic preliminaries and definitions regarding Markov 
Chain and Mixing. Section 7 is devoted to the details of the chain being used by us and 
proving that indeed it has the desired properties. We then prove upper and lower bounds on 
its mixing time using the coupling method and conductance argument in Section 8 and Section 
9 respectively. We end the paper with a conclusion and some open problems. 

3 The Idea: Maximum Matchings via Glauber Dy- 
namics 

Our ideas are inspired mainly from the results in the Glauber dynamics of the hard-core model 
of fugacity parameter A, for sampling independent sets from the Gibbs distribution. According 
to the Gibbs distribution the probability of an independent set / is given by 

Q{i) ^ 
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where Q is the set of all possible independent sets in G and Z = X^ogq '^ is also called the 
partition function. Clearly for A = 1 the partition function value is same as the number of 
independent sets in the graph (computing which is a #P-Hard problem). 
In the hard-core model given a particular configuration a of an independent set (cr can be 
thought of an n-dimensional 0/1 vector which is 1 for all the vertices which are in the inde- 
pendent set and otherwise) , we choose a vertex randomly uniformly, if this vertex is already 
present in the independent set we keep it with probability jj^ and discard it with probability 
j^ otherwise the vertex is not in the independent set and if this vertex can be added to the 
independent set (i.e. none of its neighbors are already present in the independent set) then 
again it is added with with probability j^ and rejected with probability j^. The beauty of 
this Markov Chain is that the stationary distribution is the Gibbs distribution. The details of 
this chain can be found in j31l . 

The key difference between the mixing time of Glauber Dynamics for independent sets and 
our case is that one can achieve fast mixing time in the former case only for small values of 
A. In fact intuitively one should not be able to obtain fast mixing times for large values of A 
because such a result would imply that we can design a randomized polynomial time algorithm 
for finding maximum independent set in a graph, which is an NP-Hard problem. This intuition 
has led to a series of papers [HI [261 ESI EHl EO] which ultimately has been successful in proving 
that there exists a threshold value of A = Ac such that if A > Ac then estimating the partition 
function is hard (the exact technical condition is that unless NP = RP no FPRAS exists for 
estimating Z) and for A < Ac one can obtain an FPTAS for the same problem. Previous to 
this result, the computational complexity of estimating the partition function (and counting 
the number of independent sets) was only understood for special graphs |32]- Apart from 
these results substantial attention has been given to obtain good bounds on the mixing time 
of this chain for trees |28] . 

We define a Markov Chain which is tuned to our need. In our case u is a set of edges which 
form a partial matching. We make a simple modification to the above chain, wherein instead 
of picking a random vertex we pick a random edge Cr G E and perform the same experiment 
with the parameter A as in the case of independent sets (notice that the chosen edge won't 
be added if in the present matching there is an edge sharing an end point with e^). We then 
show that this chain is aperiodic and irreducible with stationary as the Gibbs distribution over 
the space of all possible partial matchings with parameter A. We then use the techniques of 
bounding the mixing time to achieve a A independent upper bound. This remarkable property 
allows us to exploit the nature of Gibbs distribution (which we obtain for very large values of 
A) without getting an overhead on the mixing time. 

3.1 Markov Chain Monte Carlo 

Markov Chain Monte Carlo algorithms have played a significant role in statistics, econometrics, 
physics and computing science over the last two decades. For some high-dimensional problems 
in geometry, such as computing the volume of a convex body in d dimensions, MCMC simula- 
tion is the only known general approach for providing a solution within time polynomial in d 
[6]. For a number of other hard problems like approximating the permanent [18], approximate 
counting y^, the only known FPRASs (Fully Polynomial time Randomized Approximation 
Schemes) rely on the MCMC paradigm. In this paper, we use this method to obtain a faster 
algorithm for the classical problem of finding maximum matchings in general graphs, a prob- 
lem which is known to be solvable in polynomial time. 



The Markov Chain Monte Carlo (MCMC) method is a simple and frequently used approach 
for sampling from the Gibbs distribution of a statistical mechanical system. The idea goes 
like this, we design a Markov chain whose state space is Q whose stationary distribution is the 
desired Gibbs distribution. Starting at an arbitrary state, we simulate the Markov chain on Q 
until it is sufficiently close to its stationary distribution. We then output the final state which 
is a sample from (close to) the desired distribution. The required length of the simulation, in 
order to get close to the stationary distribution, is traditionally referred to as the mixing time 
T or Tmix and the aim is to bound the mixing time to ensure that the simulation is efficient. 
For a detailed understanding of the theory of Markov Chains we would recommend the recent 
excellent book by Levin, Peres and Wilmer |23j . 

4 Preliminaries 

4.1 Markov chains 

Consider a stochastic process (Xi)^Q on a finite state space il. Let P denote a non-negative 
matrix of size \0,\ x \Q\ which satisfies 

Y^ Pij = 1 Vi G 17 

The process is called a Markov chain if for all times t and i, j G i7 probability of going from 
i^^ state to j^^ state is independent of the path by which i*^ state is reached i.e. if Xt is the 
state of the process at time t then 

V[Xt+i\Xt = xuXt-i = xt-i . . . Xo = xo] = nXt+i\Xt = Xt] 

A distribution vr is called a stationary distribution if it satisfies vrP = vr. A necessary and 
sufficient condition for a chain to have a unique stationary distribution is that the chain is 

1. Irreducible: for alH, j G il there exists a time t such that P/- > 0; and 

2. Aperiodic: for ah i € Vi, GCD {t : i^* > 0} = 1. 

A Markov Chain which has both of the above properties is called ergodic. For an ergodic 
Markov chain, if a distribution vr satisfies the detailed balance equations 

for all i,j € Q then vr is the (unique) stationary distribution and such a chain is called reversible. 

4.2 Mixing Time 

The notion of mixing time is defined as a way to measure the closeness of the distribution after 
t steps w.r.t. the stationary distribution. The total variation distance between two discrete 
probability distributions over a finite space fl is defined as the half of the ^i norm of the 
corresponding probability vectors. 
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If Pt is the probability distribution after t steps in the random walk then T^ix is the minimum 
t for which, 

dTviPt,7r)<^ 
ze 

where vr is the stationary distribution. Therefore, if we intend to get close to a stationary 

distribution we just have to truncate the random walk on the state space after r steps. 

4.3 Conductance 

The conductance of a Markov chain is defined as the following quantity. 

Another quantity of our interest here is Treiax, the relaxation time of the Markov chain. Treiax 
is defined as the inverse of conductance i.e. T^eiax = tIg)' ^^ ^^ known that T^ix and Treiax 
obey the following inequality [Ij . 

-^ relax ~r J- _; ^mix- 



-^ relax — ^M / 



Also it is known 1281 that 



a bound usually used to prove lower bounds on the Mixing time of Markov Chains. 

5 The Chain 

In this section, we describe the chain considered by our algorithm which is essentially a modi- 
fication of the hard-core model of Glauber Dynamics. Recall that our objective is to come up 
with a chain whose stationary distribution ensures that the probability of being at a maximum 
matching is the largest. Recall that first we need to ensure that our chain is aperiodic and 
irreducible. 

We use the following natural modification of the Markov chain for the hard-core model of 
Glauber dynamics. 

• Choose an edge e^ uniformly at random from E. 

• Let 

A 



a' 



(7|J{er}, with probability j—i^ 
a\{er}, with probability j^ 

• If a' is a valid matching, move to state a' otherwise remain at state a. 

We are now prepared to prove that this is a valid Markov Chain with stationary as the 
Gibbs distribution, where we define Gibbs distribution as the following distribution over the 
set of all possible matchings 0, as 

Q{M) 



In the rest, we will call Z = Ylixen ^ 



Lemma 1. Xt is ergodic with stationary distribution as Q. 

Proof. Since for every state there is some probability by which the walk can remain in the 
same state, the chain is aperiodic. Also the underlying graph is connected because given any 
matching there is a at least one path to reach any other matching (consider the path that first 
drops all the edges of the initial matching and adds the edges of the new matching one by 
one). 

To show that the stationary of this distribution is ^, we would show that the chain is reversible 
w.r.t. Q. Consider two distinct states i and j in the Markov Chain. Assume w.l.o.g that there 
is an edge from i to j (if there is no edge from i to j then the balance equations corresponding 
to i and j are trivially satisfied) . Let the configuration i has t edges in it then according to our 
construction, j will have either t + 1 or i — 1 states (it won't be t because we have assumed that 
the two states are distinct). We will just look at the case when j has t + 1 edges, the other case 
is analogous. Since we are interested in showing reversibility w.r.t. Q, T^i = ^ and ttj = —z~- 
Therefore, TuPij = $ • ;;^^, and HjPji = ^ • ^^^^. Thus i^iPij = TTjPji = ^^^^ 

D 

6 Upper Bound using Coupling 

In this section we prove an upper bound on Tmix of the chain defined in the previous section. 
Our bound is based on the coupling argument introduced by Bubley and Dyer fl\. We first 
give a description of the coupling method. 

6.1 Coupling Method 

A coupling of a Markov chain on state space is a stochastic process (o"t,7/j) on \Q\ x |il| such 
that: 

• (Tj and r]t are copies of the original Markov chain and 

• if o-t = 7?t, then at+i = ??t+i 

Thus the chains follow each other after the first instant when they hit each other. In order to 
measure the distance between the two copies of the chain, one introduces a distance function 
$ on the product state space x fi so that $ = <I>((Tt,r/f) = <^=^ at = rjf. For two states 
o" and r/ let p{a, rf) be the set of all paths from a to ?? in the Markov Chain. The following 
theorem due to to Bubley and Dyer is used to prove mixing time on Markov chain. 

Theorem 2. Let <I> he an integer-valued metric defined onQxQ which takes values in {0,1... D} 

such that, for all a,ri £ 0, there exists a path ^ G p{a, rf) with 

i 

Suppose there exists a constant /3 <1 and a coupling {at,rjt) of the Markov chain such that, 
for all at,7]t, 

E[<^{at+i,rit+i)] </3$((Tf,r/i) 



Then the mixing time is bounded by 

^ log(2eD) 

- 1-/3 

Proof. Can be found in [21 EI]- □ 

In order to bound the mixing time, we will define a coupling so as to minimize the time 
until both copies of the Markov chain reach the same state, and we will do that by defining the 
coupling in such a way that on every step both markov chains reach towards same state. The 
aim is to prove a good upper bound on E[^{at+i,i]t-\-i)] in terms of ^{at,rjt). In the following 
subsection we define the coupling: 

6.1.1 Coupling 

Consider the following process {at,r]t) on |il| x |ri| where Q is the space of all possible matchings. 

Definition 1. Choose an edge uniformly at random, 

1. If insertion is possible in both a and rj then add it probability j^ and remove it with 
probability jjrj^- 

2. If insertion is possible in one and not possible in the other then remove that edge if it is 
already present in one matching. 

Notice that here we are relying on the fact that one can insert an edge if it is already 
present in it. It is easy to verify that this indeed is a coupling for the Markov chain defined in 
the previous section. 

We use the following distance function for the aforementioned coupling. Let 

0"©7/=<eG£'|eG [{<^ \ V)\\{'>1 \ ^)) \ where a and r/ G f] 

define the distance function <I>((Tt, r/t) = |(T® 7/| which is the number of edges present in one but 
not in the other. Our objective is to upper-bound E[^(^at+i,r]t+i)] in terms of dt = ^{at,r]t). 
Based on the definition of our coupling the following cases may arise once we pick an edge Cr 
uniformly randomly: 

1. Cr can be added to both at and r]t: The subevents are (a) er was present in both 
of them, in this case ^{at+i,r]t+i) = dt, (b) Cr is not present in both of them, in which 
case again ^{at+i,r]t+i) = dt and (c) e,. is present in one but not in other, in which case 
$(o-t+i,T/t+i) = dt-1. 

2. Cr can be added in exactly one of at and rjt: The sub events for this case are (a) 
Cr is not present in both, which gives ^{at+i,r]t+i) = dt {h)er is present in one matching 
and not in other, in which case ^{at+i,'r]t+i) = dt — 1 

3. Cr can't be added to any one of at and r]t: In this case ^{at-\-i,rit+i) = dt- 
Using the above events we can prove the following: 

Lemma 2. E[<^{at+i,r]t+i)] = ^{at,Vt) (1 

V m 



Proof. Since $(o"(+i, 774-1-1) can only take two values either d or d — 1 we only need to calculate 
the the probability of the happening of one of these cases. This happens when either the event 
1(c) or the event 2(b) takes place (as mentioned above). Thus, 

Pr[$(at+i,r?i+i) = dt - 1] = Pr[l(c) U 2(6)] 

clearly the distance becomes df — 1 when the chosen edge is one of the edges in o" © r/. We can 
divide o" © 7/ in to two sets U and U. U is the set of edges which can be added to the matching 
in which it is not present, and U is the set of edge which can't be added to the matching in 
which it is not present. Using this notation we can write the desired probability as 

Pr[i(c)u2(6)] = y^l(-J- + -J-]+y^l. 

|ct©?7| _ $(at,77t) 



m m 



Therefore, 



E[<^{at+i,'nt+i)\ = [dt-l) ha* 1 



m \ m 

$(crt, r]t){l ) ( since dt = ^{at, m)) 

m I 



U 



We can now prove the following: 

Lemma 3. T^ix = 0{m\ogn). 

Proof. Given any a and r/ we define a d = <1>((T, ry) length path as cr = ^^ , ^^ ..., ^"^ = r/ such that 
^*+^ is the state obtained by removing exactly one edge from ^* G o" n('7 © ??) for i = 1, 2 ... j 
where ^-^ consists only of edges which do not belong to (T©r/ and for all k > j, ^^^^ is obtained 
by adding one edge to ^^ which belongs to r/ P|(cr©r/). Since CjC~^^ = 1 for alH = 1, 2 ... d — 1, 
we have 

$(a,r?) = ^$(f,r+i) 



Also we can write 

1 



Emat+i,vt+i)] = l3H'Tt,Vt){ with ^ = 1 



m 



this allows us to apply the result from Theorem [2] which gives the following result 

Tmix < 7 Tx- ^ rnlog(4en) = 0(m log n) 

1- 1-- 

V "^z 

where we have used /3 = (1 — 1/m) and D = 2n. D 



Our algorithm is concisely presented as follows: 



a 



Input: A Graph G = {V, E) with \V\=n and \E\ 

(Jo -^ any matching; 

A = 2"^; 

for t = to 10m log n do 

choose an edge e^ uniformly randomly ; 

{o'tUi^r}, with probabilityYT-j^ 
at\{er}, with probabilityjq^ 
if a' is a valid matching; 
<7t+i = cr'; 
else; 

o"t+i = (Tt; 
end 
return at 



m 



Algorithm 1: RandMatching 

Theorem 3. The probability that the random walk in AlgorithmUl ends on a maximum match- 
ing is at least j^. 

Proof. Let Mi be the set all of matchings of size i (and Si be its cardinality) in the given 
graph. Also, let k be the size of the maximum matching. We need to find the probability that 
the random walk lands up on a maximum matching after T = 10m log n > Tmix steps. Let 
Xt be the state after T steps, and vry be the probability distribution after T steps then by 
definition of mixing time and triangle inequality. 



Y^ ttt{uj) - Y^ g{uj] 



< Y kT(w)-gH| < ^ivTTH-aHl <- 



ujeMk 



jen 



where Y2ui<^M t('^) := Prfc(T) is the probability of reaching a maximum matching after T 
steps (the success probability of the algorithm) and ^^^j^ G{uj) := Fik{G) is the probability 
of finding a maximum matching according to the Gibbs distribution. We need to find the 
probability that Xt is a maxiinum matching. Thus we have. 



Prfc(7rr) G 



Prfc(a) 



-,Prfc(g) + - 
e e 



Also, 



Prfe(a) 



k 



j=0 



A Sk 



ik-l 



k-2t 



X Sk-\- X Sk-i + A Sk-2 + ■ ■ ■ + So 



dividing by A both numerator and denominator 
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Sk 

5fc + ^ + %^ + ... + ^ 



-^^^?- 



Afc 



We put A = Sm where S'm=max Si 



Sk 



1 



'-'fc + c* + 02 ~^ ' ' ' ~^ qk ~^ q, q ~^ q, q2 ~^ ' ' ' ~^ q, qk 



S^ 



By definition of Sm, 5^ is always < 1, hence we have 



1 _|_ "^fe-l I ^k-2 _|_ "^O 



SkSm '-'fc'-'m. 



'-'fc'-'m 



> 



> 



> 



1 



111 1 



'-'A; SkSm '-'fc'-'m SkSm 

1 
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^k \ ^-^m. ^^m iJm 



■"m ^m 



1 



1- ^ 



1 + 



Sk\ 1 - ^ 



> 



1 + 



5—1 



•J A; \Om (iJm — ly 



Since 



cfc 



"Yfc-1 



'-'m l.'-'m -L^ 



1 + 



5 



fe-1 



^fc-l 



'-'m. (.'-'m IJ 



^+^'i:)^^ 



PrA:(a) > 



1 + 



1 10 

-^^21 
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This gives us, 



P,,(,,) > 15 _ i > ^ 

^ ^ - 21 e ~ 189 



D 



Notice that our proof still goes through even if we choose a A that is larger than 5m, (we 
can take A = 2™). Therefore A can be represented using m bits. We can now prove the main 
theorem, 
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Theorem 4. Given a graph G = {V,E) with \V\ = n and \E\ = ni, there exists a randomized 
algorithm that runs in 0{mlog n) time and finds a maximum matching with high probability. 

Proof. Each step of the algorithm runs in 0(1) time (we just need to maintain an array which 
indicates whether the i vertex is occupied in the matching or not), thus one call of Algorithm 
[T]runs in 0(?TT,logn) time which by Theorem [3] lands on a maximum matching with probability 
^. Thus calling it 10 log n times independently, ensures that we land on a maximum matching 

in one call is at least 1 — (jH) °^" = 1 mj)- ^ 

7 Lower Bound via Conductance 

The conductance method as defined in the Section is used to obtain lower bounds on the 
mixing time of the a Markov chain. To get a lower bound on T^eiax we need an upper-bound 
on (f), and by definition of 0, for any cut {S,S). 



This allows us to observe: 



Lemma 4. For any graph G, the conductance of our Markov Chain satisfies 

where k is the size of maximum matching. 

Proof. To give an upper bound on conductance we need to construct a cut {S, S) for which 
we can estimate the above quantity. Let S be consisting of exactly one matching which is the 
maximum matching ruk where k is the size of the maximum matching. Thus the number of 
edges going out of S is k. 

SiG5jG5 '^i^ij _ ~Z" ^' m(A+l) _ k 1 



iE^es^^)iE^es^i) f (l-f) (1 + A)m (^i_^ 
Also using the terminology of Theorem [3] 

A'^ _ A'= _ 1 ^ 1 _ A^(A-l) _._r^(l 

X 



Z Eto^^A^ Eto5^-F"EtoF ^'""'-1 VA 



Therefore, 

EigS,iG5 '^i^ij _ k 1 ^ f k 



Thus the result follows. D 

As a result of the previous lemma we have the following result. 
Lemma 5. For any graph G, the mixing time of our Markov chain satisfies, Tmix > ^ (x) 
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Proof. Follows from results in Section 14.31 D 

Thus the lower bound is sharp (upto logarithmic factors) if the size of the matching in the 
given graph is small (say at most O(polylogn)). 

A Note on Other methods to prove Lower Bound: There are other methods, apart 
from conductance, which can be used to prove lower bounds on mixing time. Although powerful 
and useful in many contexts, it is not clear whether such methods could be applied to our chain. 
For eg. the Wilson's method [1] expects the knowledge of one eigenvector of the transition 
vector that is different from the all I's vector and the corresponding eigenvalue lies in the 
range (O, 2)- Since the matrix P for our case is an exponential sized matrix with apparently 
no useful pattern in the entries, it is not clear how to come up with such an eigenvector. In 
fact, we made several educated guesses for coming up with such a vector all of which failed to 
serve our purpose. 

8 Conclusion 

In this paper, we gave a new randomized algorithm for finding maximum matchings in general 
bipartite graphs that runs in 0(m log n) time that improves upon the running time of Micali 
and Vazirani. Our algorithm was based on the MCMC paradigm which performs a truncated 
random walk on the Markov Chain defined by the Glauber Dynamics with parameter A. Apart 
from the benefit of being very simple (both in analysis and implementation) our algorithm is 
the first near linear time complexity algorithm for the maximum matching problem for general 
graphs. Moreover, unlike [15] the running time of our algorithm is not a random variable. 

To our knowledge this is for the first time Glauber dynamics and the nature of Gibbs 
distribution has been exploited to design an faster algorithm for a problem for which efficient 
solutions are already known, and we hope this idea can be of use in other problems as well. 
The obvious open problem will be to improve both the upper-bounds and lower bounds on the 
mixing time. Is it possible to improve upon the present bound to get an 0{m) time algorithm? 
Also one can explore the possibility of proving a tighter lower bound of ^{m) by using more 
refined techniques. More specifically, it would be interesting to see if one can obtain an explicit 
eigenvector of the transition matrix P (that is different from all I's vector) and apply Wilson's 
to get an improved lower bound on the mixing time. 
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